An Optimization for MapReduce Frameworks in Multi-core Architectures

نویسندگان

  • Tharso Ferreira
  • Antonio Espinosa
  • Juan C. Moure
  • Porfidio Hernández
چکیده

MapReduce simplifies parallel programming, abstracting the programmer responsibilities as synchronization and task management. The paradigm allows the programmer to write sequential code which is automatically parallelized. The MapReduce frameworks developed today are designed for situations where all keys generated by the Map phase must fit into main memory. However certain types of workload have a distribution of keys that provoke a growth of intermediate data structures, exceeding the amount of available main memory. Based on the behavior of MapReduce frameworks in multi-core architectures for these types of workload, we promote an extension of the original strategy of MapReduce for multi-core architectures. We present an extension in memory hierarchy, hard disk and main memory, which has as objective to reduce the use of main memory, as well as reducing the page faults, caused by the use of swap. The main goal of our extension is to ensure an acceptable performance of MapReduce, when intermediate data structures do not fit in main memory and it is necessary to make use of a secondary memory.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing the use of the Hard Disk in MapReduce Frameworks for Multi-core Architectures*

MapReduce simplifies parallel programming, abstracting the responsibility of the programmer, such asing the responsibility of the programmer, such as synchronization and task management. The paradigm allows the programmer to write sequential code that is automatically parallelized. The MapReduce Frameworks developed for multi-core architectures provide large processing keys which consequently g...

متن کامل

Building a Java MapReduce Framework for Multi-core Architectures

MapReduce is a programming pattern that has been proved to be a simple abstraction on top of which can be built an efficient platform for largescale data processing in distributed environments, such as Google or Hadoop. With this pattern, application logic is expressed using sequential map and reduce functions. Thus, a runtime system can exploit the lack of side effects (pure functions) in thes...

متن کامل

Design of a novel congestion-aware communication mechanism for wireless NoC architecture in multicore systems

Hybrid Wireless Network-on-Chip (WNoC) architecture is emerged as a scalable communication structure to mitigate the deficits of traditional NOC architecture for the future Multi-core systems. The hybrid WNoC architecture provides energy efficient, high data rate and flexible communications for NoC architectures. In these architectures, each wireless router is shared by a set of processing core...

متن کامل

Parallel Programming of Cellular Automata on Multi- core and Many-core Computers

Cellular Automata (CA) models are important for many applications in physics, chemistry, biology and engineering. Thus, programming such CA models to run as parallel applications on modern computer architectures is of great interest. The approach advocated and studied in this paper is to consider CA models as intermediate formalisms that can be mapped into other (better or well-researched) comp...

متن کامل

Bitonic-MapReduce: Optimization of MapReduce on the Cell B.E. Architecture with a Bitonic Sort Senior Honors Thesis

The Cell B.E. Architecture is a novel, heterogeneous, multi-core architecture that offers opportunities for significant performance. However, a lack of programmer familiarity with explicitly parallelizing code and difficulty using its unique software-managed memory model make writing programs for the Cell difficult, even for experienced programmers. However, if tools can be made to abstract awa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013